Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
BMC Genomics ; 25(1): 405, 2024 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-38658835

RESUMO

Graph-based pangenome is gaining more popularity than linear pangenome because it stores more comprehensive information of variations. However, traditional linear genome browser has its own advantages, especially the tremendous resources accumulated historically. With the fast-growing number of individual genomes and their annotations available, the demand for a genome browser to visualize genome annotation for many individuals together with a graph-based pangenome is getting higher and higher. Here we report a new pangenome browser PPanG, a precise pangenome browser enabling nucleotide-level comparison of individual genome annotations together with a graph-based pangenome. Nine rice genomes with annotations were provided by default as potential references, and any individual genome can be selected as the reference. Our pangenome browser provides unprecedented insights on genome variations at different levels from base to gene, and reveals how the structures of a gene could differ for individuals. PPanG can be applied to any species with multiple individual genomes available and it is available at https://cgm.sjtu.edu.cn/PPanG .


Assuntos
Genômica , Genômica/métodos , Oryza/genética , Anotação de Sequência Molecular , Genoma de Planta , Variação Genética , Software , Navegador , Bases de Dados Genéticas , Nucleotídeos/genética , Genoma
3.
Nat Commun ; 13(1): 5412, 2022 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-36109518

RESUMO

Pangenomic study might improve the completeness of human reference genome (GRCh38) and promote precision medicine. Here, we use an automated pipeline of human pangenomic analysis to build gastric cancer pan-genome for 185 paired deep sequencing data (370 samples), and characterize the gene presence-absence variations (PAVs) at whole genome level. Genes ACOT1, GSTM1, SIGLEC14 and UGT2B17 are identified as highly absent genes in gastric cancer population. A set of genes from unaligned sequences with GRCh38 are predicted. We successfully locate one of predicted genes GC0643 on chromosome 9q34.2. Overexpression of GC0643 significantly inhibits cell growth, cell migration and invasion, cell cycle progression, and induces cell apoptosis in cancer cells. The tumor suppressor functions can be reversed by shGC0643 knockdown. The GC0643 is approved by NCBI database (GenBank: MW194843.1). Collectively, the robust pan-genome strategy provides a deeper understanding of the gene PAVs in the human cancer genome.


Assuntos
Neoplasias Gástricas , Povo Asiático/genética , China , Genoma Humano , Humanos , Lectinas/genética , Receptores de Superfície Celular/genética , Neoplasias Gástricas/genética
4.
Genome Res ; 32(5): 853-863, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-35396275

RESUMO

The concept of pan-genome, which is the collection of all genomes from a population, has shown a great potential in genomics study, especially for crop sciences. The rice pan-genome constructed from the second-generation sequencing (SGS) data is about 270 Mb larger than Nipponbare, the rice reference genome (NipRG), but it is still disadvantaged by incompleteness and loss of genomic contexts. The third-generation sequencing (TGS) with long reads can help to construct better pan-genomes. In this paper, we report a high-quality rice pan-genome construction method by introducing a series of new steps to deal with the long-read data, including unmapped sequence block filtering, redundancy removing, and sequence block elongating. Compared to NipRG, the long-read sequencing-based pan-genome constructed from 105 rice accessions, which contains 604 Mb novel sequences, is much more comprehensive than the one constructed from ∼3000 rice genomes sequenced with short reads. The repetitive sequences are the main components of novel sequences, which partially explain the differences between the pan-genomes based on TGS and SGS. Adding six wild rice accessions, there are about 879 Mb novel sequences and 19,000 novel genes in the rice pan-genome in total. In addition, we have created high-quality reference genomes for all representative rice populations, including five gapless reference genomes. This study has made significant progress in our understanding of the rice pan-genome, and this pan-genome construction method for long-read data can be applied to accelerate a broad range of genomics studies.


Assuntos
Oryza , Genoma , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Oryza/genética , Análise de Sequência de DNA
5.
J Cell Biochem ; 122(10): 1428-1434, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34132422

RESUMO

Interpreting functional analysis results derived from environmental samples using direct sequencing meta-omics data, including metagenomics and meta-transcriptomics data, is challenging due to their complexity. Visualization of functional analysis results can help researchers discover relevant biological insights. Despite the availability of many R packages, there lacks interactive and comprehensive graphic systems for displaying functional terms and corresponding genes in meta-omics analysis results. Here, we present ivTerm, an R-shiny package with a user-friendly graphical interface that enables users to inspect functional annotations, compare results across multiple experiments, create customized charts, and download these charts. It provides various basic and innovative chart types to visualize functional terms and involved genes. Users can also browse the description of terms obtained from the database web servers automatically. Two examples, including a metagenome analysis data for human gut and a meta-transcriptome data for coral symbiomes, are given to show the usage of ivTerm. In the end, we compared ivTerm with existing tools with similar functions, such as GOplot, ViSEAGO, and Chordomics. The tool ivTerm is convenient and efficient for biologists to gain an integrated view and develop deep insights by interactive analysis of meta-omics data. It can accelerate the procedure to develop insights from complex meta-omics data. The code for ivTerm is freely available at https://github.com/SJTU-CGM/ivTerm.


Assuntos
Biologia Computacional/métodos , Gráficos por Computador , Visualização de Dados , Software , Interpretação Estatística de Dados , Bases de Dados Factuais , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Genômica/métodos , Humanos , Metabolômica/métodos , Metagenoma , Transcriptoma
6.
Front Pharmacol ; 11: 1183, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32848786

RESUMO

A high serine content in body fluid was identified in a portion of patients with gastric cancer, but its biological significance was not clear. Here, we investigated the biological effect of serine on gastric cancer cells. Serine was added into the culture medium of MGC803 and HGC27 cancer cells, and its influence on multiple biological functions, such as cell growth, migration and invasion, and drug resistance was analyzed. We examined the global transcriptomic profiles in these cultured cells with high serine content. Both MGC803 and HGC27 cell lines were originated from male patients, however, their basal gene expression patterns were very different. The finding of cell differentiation-associated genes, ALPI, KRT18, TM4SF1, KRT81, A2M, MT1E, MUC16, BASP1, TUSC3, and PRSS21 in MGC803 cells suggested that this cell line was more poorly differentiated, compared to HGC27 cell line. When the serine concentration was increased to 150mg/ml in medium, the response of these two gastric cancer cell lines was different, particularly on cell growth, cell migration, and invasion and 5-FU resistance. In animal experiment, administration of high concentration of serine promoted cancer cell metastasis to local lymph node. Taken together, we characterized the basal gene expressing profiles of MGC803 and HGC27. The HGC27 cells were more differentiated than MGC803 cells. MGC803 cells were more sensitive to the change of serine content. Our results suggested that the responsiveness of cancer cells to microenvironmental change is associated with their genetic background.

7.
Gigascience ; 9(6)2020 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-32470133

RESUMO

BACKGROUND: In cancer cells, fusion genes can produce linear and chimeric fusion-circular RNAs (f-circRNAs), which are functional in gene expression regulation and implicated in malignant transformation, cancer progression, and therapeutic resistance. For specific cancers, proteins encoded by fusion transcripts have been identified as innovative therapeutic targets (e.g., EML4-ALK). Even though RNA sequencing (RNA-Seq) technologies combined with existing bioinformatics approaches have enabled researchers to systematically identify fusion transcripts, specifically detecting f-circRNAs in cells remains challenging owing to their general sparsity and low abundance in cancer cells but also owing to imperfect computational methods. RESULTS: We developed the Python-based workflow "Fcirc" to identify fusion linear and f-circRNAs from RNA-Seq data with high specificity. We applied Fcirc to 3 different types of RNA-Seq data scenarios: (i) actual synthetic spike-in RNA-Seq data, (ii) simulated RNA-Seq data, and (iii) actual cancer cell-derived RNA-Seq data. Fcirc showed significant advantages over existing methods regarding both detection accuracy (i.e., precision, recall, F-measure) and computing performance (i.e., lower runtimes). CONCLUSION: Fcirc is a powerful and comprehensive Python-based pipeline to identify linear and circular RNA transcripts from known fusion events in RNA-Seq datasets with higher accuracy and shorter computing times compared with previously published algorithms. Fcirc empowers the research community to study the biology of fusion RNAs in cancer more effectively.


Assuntos
Algoritmos , Biologia Computacional/métodos , RNA Circular/genética , RNA/genética , Software , Regulação da Expressão Gênica , Fusão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Neoplasias , Reprodutibilidade dos Testes , Fluxo de Trabalho
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA